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Abstract — The idea of a Potential Energy Surface (PES) forms 
the basis of almost all accounts of the mechanisms of chemical 
reactions, and much of theoretical molecular spectroscopy. It is 
assumed that, in principle, the PES can be calculated by means 
of clamped-nuclei electronic structure calculations based upon 
the Schrodinger Coulomb Hamiltonian. This article is devoted 
to a discussion of the origin of the idea, its development in the 
context of the Old Quantum Theory, and its present status in 
the quantum mechanics of molecules. It is argued that its present 
status must be regarded as uncertain. 
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I. Introduction 

The Coulombic Hamiltonian H' does not provide 
much obvious information or guidance, since there 
is [sic] no specific assignments of the electrons oc- 
curring in the systems to the atomic nuclei involved 
- hence there are no atoms, isomers, conformations 
etc. In particular one sees no molecular symmetry, 
and one may even wonder where it comes from. 
Still it is evident that all of this information must be 
contained somehow in the Coulombic Hamiltonian 
H' [1]. 

Per-Olov Lowdin, Pure. Appl. Chem. 61, p.2071, (1989) 

This paper addresses the question Lowdin wondered about 
in terms of what quantum mechanics has to says about 
molecules. A conventional chemical description of a stable 
molecule is a collection of atoms held in a semi-rigid arrange- 
ment by chemical bonds, which is summarized as a molecular 
structure. Whatever 'chemical bonds' might be physically, 
. it is natural to interpret this statement in terms of bonding 
forces which are conservative. Hence a stable molecule can 
be associated with a potential energy function that has a 
minimum value below the energy of all the clusters that the 
molecule can be decomposed into. Finding out about these 
forces, or equivalently the associated potential energy, has 
been a major activity for the past century. There is no a priori 
specification of atomic interactions from basic physical laws 
so the approach has been necessarily indirect. 

After the discovery of the electron [2] and the triumph of 
the atomic, mechanistic view of the constitution of matter, 
it became universally accepted that any specific molecule 
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consists of a certain number of electrons and nuclei in ac- 
cordance with its chemical formula. This can be translated 
into a microscopic model of point charged particles interacting 
through Coulomb's law with non-relativistic kinematics. These 
assumptions fix the molecular Hamiltonian as precisely what 
Lowdin referred to as the 'Coulombic Hamiltonian', 

" n 2 " pp- 

H = E5t + E ,~ . -■ a) 
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where the n particles are described by empirical charge and 
mass parameters {e,-, m; 1 = 1 , . . . n}, and Hamiltonian canon- 
ical variables {q,-,p;, i = 1 ...«}, which after quantization are 
regarded as non-commuting operators. 

As is well-known classical dynamics based on equation 
(1) fails completely to account for the stability of atoms 
and molecules, as evidenced through the facts of chemistry 
and spectroscopy. And so, starting about a century ago, there 
was a progressive modification of dynamics as applied to 
the microscopic world from classical ('rational') mechanics, 
through the years of the Old Quantum Theory until finally 
quantum mechanics was defined. This slow evolution left its 
mark on the development of molecular theory in as much that 
classical ideas survive in modern Quantum Chemistry. In the 
following sections we review some aspects of this progression; 
we also emphasize that a direct approach to a quantum theory 
of a molecule can be based on the quantized version of (1), 
simply as an extension of the highly successful quantum theory 
of the atom. 

It is of interest to compare this so-called 'Isolated Molecule' 
model with the conventional account; after all, the sentiment of 
the quotation from Lowdin reflects the widespread view that 
the model is the fundamental basis of Quantum Chemistry. 
Even though there are no closed solutions for molecules, it is 
certainly possible to characterize important qualitative features 
of the solutions for the model because they are determined by 
the form of the defining equations [1], [3], [4]. One of the most 
important ideas in molecular theory is the Potential Energy 
Surface for a molecule; this is basic for theories of chemical 
reaction rates and for molecular spectroscopy. In §11 we discuss 
some aspects of its classical origins. Then in §111 we revisit 
the same topics from the standpoint of quantum mechanics, 
where we will see that if we eschew the conventional classical 
input (classical fixed nuclei), there are no Potential Energy 
Surfaces in the solutions derived from (1). It is not the case that 
the conventional approach via the clamped-nuclei Hamiltonian 
is merely a convenience that permits practical calculation (in 
modern terms, computation) with results concordant with the 



underlying Isolated Molecule model that would be obtained 
if only the computations could be done. On the contrary, 
a qualitative modification of the formalism is imposed by 
hand. The paper concludes in §IV with a discussion of these 
results; some relevant mathematical results are illustrated in 
the Appendix. 

We wish to emphasize that the paper is about a difficult 
technical problem; it is not a contribution to the philosophy 
of science. In the traditional picture, equation (15) is widely 
held to be exact in principle, so if the adiabatic approximation 
is found to be inadequate we would expect to do 'better' by 
including coupling terms. Our analysis implies that belief is 
not well founded because (15) is not well founded a priori 
in quantum mechanics; it requires an extra ingredient put in 
by hand. It might work, or it might not; in other words it is 
not a sure-fire route to a better account. While we can't offer 
a better alternative, that information is surely important for 
chemical physics. 

II. Classical origins 

The idea of a Potential Energy Surface can be glimpsed in 
the beginnings of chemical reaction rate theory that go beyond 
the purely thermodynamic considerations of van 't Hoff and 
Duhem more than a century ago, and in the first attempts to 
understand molecular ('band') spectra in dynamical terms in 
the same period. Thereafter progress was rapid as the newly 
emerging ideas of a 'quantum theory' were developed; by 
the time that quantum mechanics was finalized (1925/6) ideas 
about the separability of electronic and nuclear motions in 
molecules were common currency, and were carried forward 
into the new era. In this section we describe how this devel- 
opment took place. 

A. Rates of chemical reactions - Rene Marcelin 

The idea of basing a theory of chemical reactions (chem- 
ical dynamics) on an energy function that varies with the 
configurations of the participating molecules seems to be 
due to Marcelin. In his last published work, his thesis, [5], 
Marcelin showed how the Boltzmann distribution for a system 
in thermal equilibrium and statistical mechanics can be used to 
describe the rate, v, of a chemical reaction. The same work was 
republished in the Annales de Physique [6] shortly after his 
death 1 . The main conclusions of the thesis were summarized 
in two short notes published in Comptes Rendus in early 1914 
[7], [8]. His fundamental result can be expressed, in modern 
terms, as 

v = M(e- AG ^ RT - e- AG -/ RT ) (2) 

where R is the molar gas constant, T is the temperature in 
Kelvin, the subscripts +,— refer to the forward and reverse 
reactions, and AG # is the change in the molar Gibbs (free) 
energy in going from the initial (+) or final (-) state to the 
'activated state'. The pre-exponential factor M is obtained 
formally from statistical mechanics. Marcelin gave several 

'Rene Marcelin was killed in action righting for France in September 
1914. 



derivations of this result using both thermodynamic argu- 
ments and also the statistical mechanics he had learnt from 
Gibbs's famous memoir [9]. It is perhaps worth remarking 
that Gibbs saw statistical mechanics as the completion of 
Newtonian mechanics through its extension to conservative 
systems with an arbitrarily large, though finite, number of 
degrees of freedom. The laws of thermodynamics could easily 
be obtained from the principles of statistical mechanics, of 
which they were the incomplete expression, but Gibbs did not 
require thermodynamic systems to be made up of molecules; 
he explicitly did not wish his account of rational mechanics to 
be based on hypotheses concerning the constitution of matter, 
which at the time were still controversial [10]. 

^From our point of view the most interesting aspect of 
Marcelin's account is the suggestion that molecules can have 
more degrees of freedom than those of simple point material 
particles. In this perspective, a molecule can be assigned a 
set of Lagrangian coordinates q = q\,q2, ■■■ ,q n , an d their 
corresponding canonical momenta p = pi,P2,---Pn- Then 
the instantaneous state of the molecule is associated with 
a 'representative' point in the canonical phase-space & of 
dimension In, and so "as the position, speed or structure of the 
molecule changes, its representative point traces a trajectory 
in the 2n-dimensional phase-space" [5]. 

In his phase-space representation of a chemical reaction the 
transformation of reactant molecules into product molecules 
was viewed in terms of the passage of a set of trajectories 
associated with the 'active' molecules through a 'critical 
surface' 5? in that divides & into two parts, one part being 
associated with the reactants, the other with the products. Such 
a [hyper] surface is defined by a relation 

S(q,p) = 

According to Marcelin, for passage through this surface it is 
required 2 [5] 

[une molecule] il faudra [....] qu'elle atteigne une 
certaine region de l'espace sous une obliquite con- 
venable, que sa vitesse depasse une certain limite, 
que sa structure interne corresponde a une configu- 
ration instable, etc.; .... 
In modern notation, the volume of a cell in the 2n- 
dimensional phase-space is 

dW = dq dp 

The number of points in dCJ is given by the Gibbs distribution 
function / 

dv = /(q,p,0 dm (3) 

Marcelin chose the distribution function for the active 
molecules as 

/(q,p,0 - e-^l RT e-^y^ T (4) 

where kg is Boltzmann's constant, Jt? is the Hamiltonian 
function for the molecule, and G* is the Gibb's free energy of 

2 that a molecule must reach a certain region of space at a suitable angle, 
that its speed must exceed a certain limit, that its internal structure must 
correspond to an unstable configuration etc; .... 
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the active molecules relative to the mean energy of the reactant 
molecules. It is independent of the canonical variables. There 
is an analogous expression for the reverse reaction involving 
G # . Marcelin quoted a formula due to Gibbs [9] for the 
number of molecules dN crossing a surface element ds in 
the critical surface in the neighbourhood of q,p, in time dt, 
which may be written in shorthand as 

dN = dt /(q,p,0 7(q,p,q,p) 

where p,q are regarded as functions of q,p by virtue of 
Hamilton's equations of motion. The total rate is 

v = jdm /(q,p) / 5[5(q,p)] (5) 

where the delta function confines the integration to the critical 
surface 5? . Equation (2) results from taking the difference 
between this expression for the forward and reverse reactions, 
and factoring out the terms in G* ; the remaining integration, 
which Marcelin did not evaluate, defines the multiplying factor 
M. 

B. Molecular spectroscopy and the Old Quantum Theory 

Although the discussion in the previous section looks fa- 
miliar, it does so only because of the modern interpretation 
we put upon it 3 . It is important to note that nowhere did 
Marcelin elaborate on how the canonical variables were to be 
chosen, nor even how n could be fixed in any given case. The 
words 'atom', 'electron', 'nucleus' do not appear anywhere 
in his thesis, in which respect he seems to have followed the 
scientific philosophy of his countryman Duhem [11]. On other 
pages in the thesis Marcelin refered to the 'structure' (also 
'architecture') of a molecule and to molecular 'oscillations' 
but never otherwise invoked the atomic structural conception 
of a molecule due to e.g. van 't Hoff, although he was very 
well aware of van 't Hoff's Physical Chemistry. 

Contemporary with Marcelin's investigation of chemical 
reaction rates was the introduction of a completely novel 
model of an atom due to Rutherford. However it quickly 
became apparent that Rutherford's solar system model of the 
atom (planetary electrons moving about a central nucleus) 
cannot avoid eventual collapse if classical electrodynamics 
applies to it. This is because of Earnshaw's theorem which 
states that it is impossible for a collection of charged particles 
to maintain a static equilibrium purely through electrostatic 
forces [16]. This is the classical result that Bohr alluded to 
in his 1922 Nobel lecture [17] to rule out an electrostatical 
explanation for the stability of atoms and molecules. 

The theorem may be proved by demonstrating a contradic- 
tion. Suppose the charges are at rest and consider the motion 

3 Nevertheless it seems proper to regard Marcelin's introduction of phase- 
space variables and a critical reaction surface into chemical dynamics as the 
beginning of a formulation of the Transition State Theory that was developed 
by Wigner in the 1930's [12] - [15]. The In phase-space variables q,p 
were identified with the n nuclei specified in the chemical formula of the 
participating species, and the Hamiltonian M" was that for classical nuclear 
motion on a Potential Energy Surface; this dynamics was assumed to give rise 
to a critical surface which was such that reaction trajectories cross the surface 
precisely once. The classical nature of the formalism was quite clear because 
the Uncertainty Principle precludes the precise specification of position on 
the critical surface simultaneously with the momentum of the nuclei. 



of a particular charge e n in the electric field, E, generated by 
all of the other charged particles. Assume that this particular 
charge has e n > 0. The equilibrium position of this particle 
is the point where E(x^) = 0, since the force on the charge 
is e„E(x„) (the Lorentz force for this static case). Obviously, 
x° cannot be the equilibrium position of any other particle. 
However, in order for x° to be a stable equilibrium point, the 
particle must experience a restoring force when it is displaced 
from x° in any direction. For a positively charged particle at 
x^, this requires that the electric field points radially towards 
xjj at all neighbouring points. But from Gauss's law applied 
to a small sphere centred on xjj, this corresponds to a negative 
flux of E through the surface of the sphere, implying the 
presence of a negative charge at x°, contrary to our original 
assumption. Thus E cannot point radially towards x° at all 
neighbouring points, that is, there must be some neighbouring 
points at which E is directed away from xj). Hence, a positively 
charged particle placed at xj) will always move towards such 
points. There is therefore no static equilibrium configuration. 
According to classical electrodynamics accelerated charges 
must radiate electromagnetic energy, and hence lose kinetic 
energy, so even a dynamical model cannot be stable according 
to purely classical theory. 

Molecular models which can be represented in terms of 
the (phase-space) variables of classical dynamics had a far- 
reaching influence on the interpretation of molecular spectra 
after the dissemination of Bohr's quantum theory of atoms 
and molecules based on transitions between stationary states 
[18]. An important feature of his new theory was that classical 
electrodynamics should be deemed to be still operative when 
transitions took place, but not when the system was in a 
stationary state, by fiat. Bohr had originally used the fact that 
two particles with Coulombic interaction lead to a Hamiltonian 
problem that is completely soluble by separation of variables. 
With more particles and Coulombic interactions this is no 
longer true; however by largely qualitative reasoning he was 
able to develop a quantum theory of the atom and the Periodic 
Table (reviewed in [17]). Furthermore by the introduction of 
Planck's constant h through the angular momentum quantiza- 
tion condition, Bohr solved another problem of the classical 
theory. In classical electrodynamics the only characteristic 
length available is the classical radius r„ for a charged particle. 
This is obtained by equating the rest-mass energy for the 
charge to the electrostatic energy of a charged sphere of radius 

( e 2 \ 
\ Ane^mc 2 ) 

For an electron this yields r Q rts 2.8 x 10~ 15 m and an even 
smaller value for any nucleus. It was clear that this was far 
too small to be relevant to an atomic theory; of course the Bohr 
radius a « 0.5 x 10~ 10 m is of just the right dimension. 

Bohr's theory developed into the Old Quantum Theory 
which was based on a phase-space description of an atomic- 
molecular system and theoretical techniques originally de- 
veloped in celestial mechanics. These came from the ap- 
plication of the developing quantum theory to molecular 
band spectra by Schwarzschild [19] and Heurlinger [20] who 
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used it to describe the quantized vibrational and rotational 
energies of small molecules (diatomic and symmetric top 
structures). Schwarzschild, an astrophysicist, was responsible 
for the introduction of action-angle methods as a basis for 
quantization in atomic/molecular theory. Heurlinger assumed 
a quantization of the energy of the nuclear vibration analogous 
to that used by Planck for his ideal linear oscillators, with 
the possibility of anharmonic behaviour. Thus a force-law or 
potential energy depending on the separation of the nuclei, for 
a given arrangement of the electrons, was required. 

The basic calculational tool was a perturbation theory 
approach developed enthusiastically by Born [21] and Som- 
merfeld [22] with their research assistants. The solution of the 
Hamiltonian equations of motion could be attempted via the 
Hamilton-Jacobi method based on canonical transformations 
of the action-angle variables. This leads to an expression for 
the energy that is a function of the action integrals only. The 
action (or 'phase') integrals are constants of the motion, and 
are also adiabatic invariants [23], and as such are natural 
objects for quantization according to the 'quantum conditions' . 
Thus for a separable system with k degrees of freedom and 
action integrals {/;, i = 1 ...j < k}, the quantum conditions 
according to Sommerfeld are 



/; = jpi dqi = nth, i = 



(6) 



where the n, are non-negative integers (j < k in case of 
degeneracy). Here it is assumed that each pi is a periodic 
function of only its corresponding conjugate coordinate g,, 
and the integration is taken over a period of g,. An important 
principle, due to Bohr, was that slow, continuous ('adiabatic') 
deformations of an atomic system kept the system in a 
stationary state [24], [25]. Thus the action integrals for a 
Hamiltonian depending on parameters that vary slowly in 
time are conserved under slow changes of the parameters 4 . 
This could be applied to the problem of chemical bonding by 
treating the nuclear positions as the slowly varying parameters 
in an adiabatic transformation of the Hamiltonian for the 
electrons in the presence of the nuclei. 

We now know that systems of more than 2 particles with 
Coulomb interactions may have very complicated dynamics; 
Newton famously struggled to account quantitatively for the 
orbit of the moon in the earth-moon-sun problem (n = 3). 
The underlying reason for his difficulties is the existence of 
solutions carrying the signature of chaos [27] and this implies 
that there are classical trajectories to which the quantum con- 
ditions simply cannot be applied 5 because the integrals in (6) 
do not exist [28]. We also know that the r _1 singularity in the 
classical potential energy can lead to pathological dynamics in 
which a particle is neither confined to a bounded region, nor 
escapes to infinity for good. If the two-body interaction V(r) 
has a Fourier transform v(k) the total potential energy can be 



4 This is strictly true only for integrable Hamiltonians [26]. 

5 The difficulties for action-angle quantization posed by the existence of 
chaotic motions in non-separable systems [30] were recognized by Einstein 
at the time the Old Quantum Theory was developed [31]. 



expressed as 

U = j^etej V(\xi-Xj\ 



= -£v ( o) 



(2*) 



i e 



ik.Xj 



In the case of the Coulomb interaction v(k) = 4-n/k 2 > 
and so the potential energy U is bounded from below by 
— nV(0)/2; unfortunately for point charges as 0,V(r) — > 
±°° and collapse may ensue [29]. 

Attempts were made by Born and his assistants to dis- 
cuss the stationary state energy levels of 'simple' non-trivial 
systems such as He, H^, H2, H2O. The molecular species 
were tackled as problems in electronic structure, that is, as 
requiring the calculation of the energy levels for the electron(s) 
in the field of fixed nuclei as a calculation separate from 
the rotation-vibration of the molecule as a whole. Pauli gave 
a lengthy qualitative discussion of the possible Bohr orbits 
for the single electron moving in the field of two fixed 
protons in H \ but could not obtain the correct stationary states 
[32]. Nordheim investigated the forces between two hydrogen 
atoms as they approach each other adiabatically 6 in various 
orientations consistent with the quantum conditions. Before 
the atoms get close enough for the attractive and repulsive 
forces to balance out, a sudden discontinuous change in the 
electron orbits takes place and the electrons cease to revolve 
solely round their parent nuclei. Nordheim was unable to find 
an interatomic distance at which the energy of the combined 
system was less than that of the separated atoms; this led to the 
conclusion that the use of classical mechanics to discuss the 
stationary states of the molecular electrons had broken down 
comprehensively [33], [34]. This negative result was true of all 
the molecular calculations attempted within the Old Quantum 
Theory framework which was simply incapable of accounting 
for covalent bonding [35]. 

The most ambitious application of the Old Quantum Theory 
to molecular theory was made by Born and Heisenberg [36]. 
They started from the usual non-relativistic Hamiltonian (1) 
for a system comprised of n electrons and N nuclei interacting 
via Coulombic forces. They assumed there is an arrangement 
of the nuclei which is a stable equilibrium, and use that 
(a molecular structure) as a reference configuration for the 
calculation. Formally the rotational motion of the system can 
be dealt with by requiring the coordinates for the reference 
structure to satisfy 7 what were later to become known as 'the 
Eckart conditions' [37]. Then with a suitable set of internal 
variables and 



(-) 



as the expansion parameter, the Hamiltonian was expressed as 
a series 



H 



l 2 H 2 



(7) 



to be treated by the action-angle perturbation theory Born 

6 This is the earliest reference we know of where the idea of adiabatic 
separation of the electrons and the nuclei is proposed explicitly. 

7 This also deals with the uninteresting overall translation of the molecule. 
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had developed. The 'unperturbed' Hamiltonian H is the full 
Hamiltonian for the electrons with the nuclei fixed at the 
equilibrium structure, H2 is quadratic in the nuclear variables 
(harmonic oscillators) and also contains the rotational energy 8 , 
while . . . stands for higher order anharmonic vibrational terms. 
Hi may be dropped because of the equilibrium condition. 
With considerable effort there follows the usual separation of 
molecular energies, although of course no concrete calculation 
was possible within the Old Quantum Theory framework. 
It is noteworthy that their calculation gives the electronic 
energies at a single configuration because the perturbation 
calculation requires the introduction of the (assumed) equilib- 
rium structure. This is different from the adiabatic approach 
Nordheim tried (unsuccessfully) to get the electronic energy 
at any separation of the nuclei [33]. 

III. Quantum Theory 

With the completion of quantum mechanics in 1925-26, 
the old problems in atomic and molecular theory were re- 
considered and considerable success was achieved. The idea 
that the dynamics of the electrons and the nuclei should be 
treated to some extent as separate problems was generally 
accepted. Thus the electronic structure calculations of London 
[39] - [41] can be seen as a successful reformulation of the 
approach Nordheim had tried in terms of the older quantum 
theory, and the idea of 'adiabatic separation' is often said 
to originate in this work. It is however also implied in the 
closing section of Slater's early He atom paper where he 
sketches (but does not carry through) a perturbation method of 
approximate calculation for molecules in which the nuclei are 
first held fixed, and the resulting electronic eigenvalue(s) then 
act as the potential energy for the nuclei [42]. A quantum 
mechanical proof of Ehrenfest's adiabatic theorem for time- 
dependent perturbations was given by Born and Fock [43]. 
Most famously though, the quantum mechanical basis for the 
idea of electronic Potential Energy Surfaces is commonly at- 
tributed to Born and Oppenheimer, and it is to a consideration 
of their famous paper [38] that we now turn. 

A. Born and Oppenheimer' s Quantum Theory of Molecules 

Much of the groundwork for Born and Oppenheimer's 
treatment of the energy levels of molecules was laid down 
in the earlier attempt by Born and Heisenberg [36]. The 
basic idea of both calculations is that the low-lying excita- 
tion spectrum of a molecule can be obtained by regarding 
the nuclear kinetic energy as a 'small' perturbation of the 
energy of the electrons for stationary nuclei in an equilibrium 
configuration. The physical basis for the idea is the large 
disparity between the mass of the electron and the masses of 
the nuclei; classically the light electrons undergo motions on a 
'fast' timescale (x e w 10~ 16 — > 10~ 15 s), while the vibration- 
rotation dynamics of the much heavier nuclei are characterized 
by 'slow' timescales (% w 1CT 14 -> lCT 12 s). 

8 The rotational and vibrational energies occur together because of the 
choice of the parameter A; as is well-known, Born and Oppenheimer later 
showed that a better choice is to take the quarter power of the mass ratio 
as this separates the vibrational and rotational energies in the orders of the 
perturbation expansion [38]. 



Consider a system of electrons and nuclei and denote 
the properties of the former by lower-case letters (mass m, 
coordinates x, momenta p) and of the latter by capital letters 
(mass M, coordinates X, momenta P). The small parameter 
for the perturbation expansion must clearly be some power 
of m/M , where M Q can be taken as any one of the nuclear 
masses or their average. In contrast to the earlier calculation 
they found the correct choice is 



K = 



m 



rather than Born and Heisenberg's X — K 2 . In an obvious 
shorthand notation using a coordinate representation the ki- 
netic energy of the electrons is then 9 



while the nuclear kinetic energy depends on K 



T N = 



dX 



The Coulomb energy is simply U(x,X). They then define the 
'unperturbed' Hamiltonian 



(8) 



T e + U = H ^x,—,X 

and express the total Hamiltonian as 

H = H + K 4 H\ 
with Schrodinger equation 



H E )y(x,X) = 0. 



(9) 



(10) 



At this point in their argument they state 
Setzt man in (12) [equation(lO) above] K 
bekommt man eine Differentialgleichung fiir die xt 
allein, in der die X[ als Parameter vorkommen: 



0, so 



H, 



d 

x,—;X 
ox 



W}\j/ = 0. 



Sie stellt offenbar die Bewegung der Elektronen bei 

festgehaltenen Kernen dar 10 . 
This splitting of the Hamiltonian into an 'unperturbed' part 
(k = 0) and a 'perturbation' is essentially the same as in the 
earlier Old Quantum Theory version [36]. The difference here 
is that the action-angle perturbation theory of the Old Quantum 
Theory is replaced by Schrodinger's quantum mechanical 
perturbation theory. In the following it is understood that 
the overall translational motion of the molecule has been 
separated off by a suitable coordinate transformation; this is 
always possible. The initial step in setting up the perturbation 
expansion involves rewriting the Hamiltonian H as a series in 
increasing powers of K. This is achieved by introducing new 

9 The details can be found in the original paper [38], and in various 
English language presentations, for example [44] - [46]. 

10 If one sets K = O....one obtains a differential equation in the xu alone, the 

Xi appearing as parameters: Evidently, this represents the electronic motion 

for stationary nuclei. 
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relative coordinates that depend on K 

X = X + rcC (11) 

for some fixed X , and using the {£} as the nuclear variables, 
in an expansion of H about X . 

Then as usual the eigenfunction and eigenvalue of (10) are 
presented as series in K 

y/ = y/ (0 ) + k y/ 1 ' + k 2 y/ 2 ' + • • • 
E = + + k 2 E^ +..., 

the expansions are substituted into the Schrodinger equation 
(10), and the terms separated by powers of K. This gives a set 
of equations to be solved sequentially. The crucial observation 
that makes the calculation successful is the choice of X ; the 
Schrodinger equation for the unperturbed Hamiltonian H a can 
be solved for any choice of the nuclear parameters X, and 
yields 11 an unperturbed energy E(X) for the configuration X. 
For the consistency of the whole scheme however it turns out 
(cf footnote 9) that X in equation (11) cannot be arbitrarily 
chosen, but must correspond to a minimum of the electronic 
energy. That there is such a point is assumed to be self-evident 
for the case of a stable molecule. The result of the calculation 
was a triumph; the low-lying energy levels of a stable molecule 
can be written in the form 

£ Mol = £ Elec + 1(2 E Vib + K ^ %ot +••• ( 12 ) 

in agreement with a considerable body of spectroscopic ev- 
idence. The eigenfunctions that correspond to these energy 
levels are simple products of an electronic wavefunction 
obtained for the equilibrium geometry and suitable vibration- 
rotation wavefunctions for the nuclei. 

The Born and Heisenberg calculation [36] had been per- 
formed while Heisenberg was a student with Born; Kragh [35] 
quotes Heisenberg's later view of it in the following terms 
As an exasperated Heisenberg wrote to Pauli, "The 

work on molecules I did with Born contains 

bracket symbols [Klammersymbole] with up to 8 
indices and will probably be read by no one." 
Certainly, it was not read by the chemists. 
Curiously that may have initially been the fate of Born and 
Oppenheimer's paper. As noted by one of us many years ago, a 
survey of the literature up to about 1935 shows that the paper 
was hardly if ever mentioned, and when it was mentioned, 
its arguments were used as a posteriori justification for what 
was being done anyway [47]. What was being done was the 
general use in molecular spectroscopy and chemical reaction 
theory of the idea of Potential Energy Surfaces on which 
the nuclei moved. As we have seen, that idea is not to be 
found in the approach taken by Born and Oppenheimer which 
used (and had to use) a single privileged point in the nuclear 
configuration space - the assumed equilibrium arrangement of 
the nuclei [38]. 

In 1935 a significant event was the publication of the famous 
textbook Introduction to Quantum Mechanics [48] which was 
probably the first textbook concerned with quantum mechanics 

n W(X) in the notation of the above quotation. 



that addressed in detail problems of interest to chemists. 
Generations of chemists and physicists took their first steps 
in quantum theory with this book, which is still available in 
reprint form. Chapter X of the book is entitled The Rotation 
and Vibration of Molecules; it starts by summarizing the em- 
pirical results of molecular spectroscopy which are consistent 
with (12). The authors then turn to the wave equation for a 
general collection of electrons and nuclei and remark that its 
Schrodinger wave equation may be solved approximately by 
a procedure that they attribute to Born and Oppenheimer; first 
solve the wave equation for the electrons alone, with the nuclei 
in a fixed configuration, and then solve the wave equation 
for the nuclei alone, in which a characteristic energy value 
[eigenvalue] of the electronic wave equation, regarded as a 
function of the internuclear distances, occurs as a potential 
function. After some remarks about the coordinates they say 
The first step in the treatment of a molecule is to 
solve this electronic wave equation for all configura- 
tions of the nuclei. It is found that the characteristic 
values U n (%) of the electronic energy are continuous 
functions of the nuclear coordinates E, . For example, 
for a free diatomic molecule the electronic energy 
function for the most stable electronic state (n = 0) 
is a function only of the distance r between the two 
nuclei, and it is a continuous function of r, such as 
shown in Figure 34-2. 
Figure 34-2 referred to here is a Morse potential function. 
Later in the book where they give a brief introduction to 
activation energies of chemical reactions they explicitly cite 
London [41] as the origin of the idea of adiabatic nuclear 
motion on a Potential Energy Surface, though there is also 
a nod back towards Chapter X. Although it is now almost 
universal practice to refer to treating the nuclei as classical par- 
ticles that give rise to an electronic energy surface as 'making 
the Born-Oppenheimer approximation' it is our opinion that 
the justification for such a strategy is not to be found in The 
Quantum Theory of Molecules, [38]. Nor is it to be found in 
the early papers of London [39] - [41] where it was simply 
assumed as a reasonable thing to do. And it is certainly the 
case that Born and Oppenheimer did not show the electronic 
energy to be a continuous function of the nuclear coordinates; 
that was first demonstrated for a diatomic molecule forty years 
after Pauling and Wilson's book was published (see §III-D). 

B. Born and the elimination of electronic motion 

Many years after his work with Heisenberg and Oppen- 
heimer, Born returned to the subject of molecular quantum 
theory and developed a different account of the separation 
of electronic and nuclear motion [44], [49]. It is to this 
method that the expression 'Born-Oppenheimer approxima- 
tion' usually refers in modern work. Consider the unper- 
turbed electronic Hamiltonian H a {x,Xf) at a fixed nuclear 
configuration Xf that corresponds to some molecular structure 
(not necessarily an equilibrium structure). The Schrodinger 
equation for H is 

(H (x,X f ) - E°(X f ) m ^(p{x,X f ) m = 0. (13) 
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This equation can have both bound-state and continuum eigen- 
functions; the bound-state eigenvalues considered as functions 
of the Xf are the molecular Potential Energy Surfaces. Born 
proposed to solve the full molecular Schrodinger equation, 
(10) by an expansion 



W{x,X) = (X) m <p(x,X)„ 



(14) 



with coefficients {<P(X) m } that play the role of nuclear wave- 
functions. As in the original calculation (§III-A) a crucial step 
is to assign the nuclear coordinates the role of parameters in 
the Schrodinger equation (13) for the electronic Hamiltonian; 
it differs from the earlier approach of Born and Oppenheimer 
because now the values of Xf range over the whole nuclear 
configuration space. Substituting this expansion into (10), 
multiplying the result by (p(x,X)* and integrating over the 
electronic coordinates x leads to an infinite dimensional system 
of coupled equations for the nuclear functions {4>}, 

(T N + E"{X) n - E)<S>(X) n + Y,C{X,P)nn'HX)n> = 

nn' 

(15) 

where the coupling coefficients {C(X ,P) nn i} have a well- 
known form which we need not record here [44]. 

In this formulation the adiabatic approximation consists 
of retaining only the diagonal terms in the coupling matrix 
C(X,P), for then a state function can be written as 



yr(x,X) 



<p(x,X)„ *(*)„. (16) 



and a product wavefunction corresponds to additive electronic 
and nuclear energies. The special character of the electronic 
wavefunctions {q>(x,X) m } is, by (13), that they diagonalize 
the electronic Hamiltonian H ; they are said to define an 
'adiabatic' basis (cf the approximate form (16)) because the 
electronic state label n is not altered as X varies. The Born 
approach does not really require the diagonalization of H ; 
it is perfectly possible to define other representations of the 
electronic expansion functions through unitary transformations 
of the {<p}, with concomitant modification of the coupling 
matrix C. This leads to so-called 'diabatic' bases; the freedom 
to choose the representation is very important in practical 
applications to spectroscopy and atomic/molecular collisions 
[50], [51]. 

C. Formal quantum theory of the molecular Hamiltonian 

We now start again and develop the quantum theory of 
the Hamiltonian for a collection of n charged particles with 
Coulombic interactions 12 . We remind ourselves again from §1 
that for particles with classical Hamiltonian variables {q;,P;} 
this is 

yPi_ ^ e t e, 
t?2m,: 



H 



y. 



(17) 



with the non-zero Poisson-bracket 



12 The reader may find it helpful to refer to the Appendix which summa- 
rizes some mathematical notions that are needed here, and illustrates them in 
a simple model of coupled oscillators with two degrees of freedom. 



Let us denote the classical dynamical variables for the elec- 
trons collectively as x,p, and those for the nuclei by X,P 
and denote the classical Hamiltonian by H(x,p,X,P). After 
the customary canonical quantization these variables become 
time-independent operators in a Schrodinger representation 

x — > x etc. 

In the following it will be important to distinguish between 
operators and c-numbers, so in the following we will use 
the x notation for operators, and make no special choice of 
representation. 

As we have seen, the idea that the kinetic energy of the 
massive nuclei could be treated as a perturbation of the 
electronic motion was first formulated in the framework of the 
Old Quantum Theory. The idea was to separate the classical 
Hamiltonian H into two parts to isolate the nuclear momentum 
variables 



H(x,p,X,P) = //„(x,p,X) + »c 4 //i(P). 



(18) 



According to Hamilton's equations for the unperturbed prob- 

lem dX 

— = {X,H } = 0, (19) 

using Poisson-bracket notation, which was interpreted (cor- 
rectly) as describing the dynamics of the electrons in the field 
of stationary nuclei. This was the starting point of Born and 
Heisenberg's calculations [36]. 

Let us now move to quantum theory and recast (18) as an 
operator relation, writing the molecular Hamiltonian operator 



H(x,p,X,P) = # (x,p,X) + K 4 #!(P) 
with equation of motion under H 



dX 
ih—— = 
dt 



[X,HJ = 



(20) 



(21) 



from which we infer the nuclear position operators X are 
constants of the motion under H„. We no longer make the 
interpretation that follows from (19) since specifying precisely 
the positions {X} for stationary nuclei violates the Uncertainty 
Principle. Instead (21) leads to a completely different conclu- 
sion (see below). 

We must now take a little bit of care about the definition of 
the variables, and dispose of the uninteresting overall motion 
of the molecule [4]. Since the Coulomb interaction only 
depends on interparticle distances it is translation invariant, 
and therefore the total momentum operator P 

P = Ep« 

n 

commutes with H. It follows that the molecular Hamiltonian 
may be written as a direct integral 



where [52] 



H = J r3 H(P) dP 



H(P) = + H' 

V ' 2M T 



(22) 



(23) 
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is the Hamiltonian at fixed total momentum P and Mj is the 
molecular mass. The internal Hamiltonian H' is independent 
of the centre-of-mass variables and is explicitly translation 
invariant. The form of H' is not uniquely fixed but whatever 
coordinates are chosen the essential point is that it is always 
the same operator specified in (23) acting on a Hilbert space 
f) that may be parameterized by functions of the electron and 
nuclear coordinates. 

The separation of the centre-of-mass terms from the internal 
Hamiltonian is the same in quantum mechanics as in clas- 
sical mechanics so we need not distinguish operators from 
classical variables in this step. It is convenient to choose the 
centre-of-nuclear mass for the definition of suitable internal 
coordinates 13 . Let t e be a set of internal electronic coordinates 
defined as the original electronic coordinates x referred to the 
centre-of-nuclear mass, and let t n be a set of internal nuclear 
coordinates constructed purely from the original nuclear co- 
ordinates X. If there are s electrons and M nuclei, there are 
s internal electronic coordinates, and M — 1 internal nuclear 
coordinates. There are corresponding canonically conjugate 
internal momentum variables. In terms of these variables the 
total kinetic energy of all the particles can be decomposed into 
the form 



T, 



CM 



T e 



(24) 



where Tq^ is the kinetic energy for the centre-of-mass, T^ 
is the kinetic energy for the nuclei expressed purely in terms of 
the internal nuclear momentum variables, and T e is the kinetic 
energy for the electrons expressed purely in terms of the 
internal electronic momentum variables. The Coulomb energy 
can be expressed purely in terms of the internal coordinates, 
U = f/(t e ,t n ). These relations are true both classically and 
in quantum mechanics with a suitable operator interpretation. 

In parallel with the decomposition in equation (18), we 
define the quantum mechanical 'electronic' Hamiltonian as 

jelec 



T e + U(f,t a ) 



(25) 



so that after dropping the uninteresting kinetic energy for the 
overall centre-of-mass, we see that the internal Hamiltonian 
has the form, 



H' = H 



elec 



T 



N 



(26) 



where, as before, the nuclear kinetic energy term is propor- 
tional to K 4 (see footnote 13). Its Schrodinger equation may 
be written 

H' \9 m ) = E m \9 m ) (27) 

where m is used to denote a set of quantum numbers 
(J M p r i): J and M for the angular momentum state: p 
specifying the parity of the state: r specifying the permuta- 
tionally allowed irreducible representations within the groups 
of identical particles, and i to specify a particular energy value. 
Any bound state (a 'molecule') has an energy lying below the 



13 It is always possible to split off the kinetic energy of the centre-of-mass 
without any approximation; with this choice we retain the separation of the 
electronic and nuclear kinetic energies as well, as in (24). Explicit formulae 
are given in e.g. [3] where it is shown that the nuclear kinetic energy terms 
involve reciprocals of the nuclear masses, so that overall, the nuclear kinetic 
energy is proportional to K 4 . 



start of the essential spectrum. 

Now just as in equation (21) H e l ec is independent of the 
nuclear momentum operators and so it commutes with the 
internal nuclear position operators 



[H elec ,t n ] 



= 0. 



(28) 



They may therefore be simultaneously diagonalized and we 
use this property to characterize the Hilbert space Jf? for 
H elec . Let b be' some eigenvalue of the t n corresponding to 
choices {x g = a g ,g = 1,. . .M} in the laboratory-fixed frame; 
then the {a g } describe a classical nuclear geometry. The set, 
X, of all b is R 3 ( M -!). We denote the Hamiltonian H elec eval- 
uated at the nuclear position eigenvalue b as K(b,t e )„ = K 
for short; this K„ is very like the usual clamped-nuclei Hamil- 
tonian but it is explicitly translationally invariant, and has an 
extra term, which is often called the Hughes-Eckart term, or 
the mass polarization term. Its Schrodinger equation is of the 
same form as (13), with eigenvalues E°(b)k and corresponding 
eigenfunctions (p(t e ,b)k, 



K <p(b,t e 



E°(b) k <p(b,t e 



(29) 



As before its spectrum in general contains a discrete part below 
a continuum, 



CT(b) = cr(K(b,t e ) ) 



£°(b)o,...£°(b) m |J 



A(b), 
! (30) 

Note that for other than diatomic molecules, it is not 
possible to proceed further and separate out explicitly the 
rotational motion. For any choice of b the eigenvalues of K 
will depend only upon the shape of the geometrical figure 
formed by the {a A ,}, being independent of its orientation. 
It is possible to introduce a so-called body-fixed frame by 
transforming to a new coordinate system built out of the 
b consisting of three angular variables and 3M — 6 internal 
coordinates. In so doing however one cannot avoid angular 
momentum terms arising which couple the electronic and 
nuclear variables, and so there is no longer a clean separation 
of the kinetic energy into an electronic and a nuclear part. 
Moreover no single specification of body-fixed coordinates can 
be given that describes all possible nuclear configurations. 

The internal molecular Hamiltonian H' in (23) and the 
clamped-nuclei like operator K just defined can be shown to 
be essentially self-adjoint (on their respective Hilbert spaces) 
by reference to the Kato-Rellich theorem [53] because in both 
cases there are kinetic energy operators that dominate the 
(singular) Coulomb interaction; they therefore have a complete 
set of eigenfunctions. As regards H e ^ ec , we have a family 
of Hilbert spaces {Jf? (b)} which are parameterized by the 
nuclear position vectors b G X that are the 'eigenspaces' of 
the family of self-adjoint operators K„; from them we can 
construct a big Hilbert space as a direct integral over all the 
b values 

Jif = j JT(b) db (31) 

and this is the Hilbert space for H elec in (25). 

Equation (31) leads directly to a fundamental result; since 
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pelec commute s with all the {t 11 }, it has the direct integral 
decomposition 

H elec = j^k(b,i e ) db. (32) 

Even if the ' clamped-nuclei' Hamiltonian has a set of discrete 
states - Potential Energy Surfaces - equation (32) implies that 
the unperturbed Hamiltonian 14 , H e ^ ec , has purely continuous 
spectrum (cf the Appendix), 

a = a(H elec ) = |J a(b) ee [VoH 

b 

where Vo is the minimum value of E(b)o', in the diatomic 
molecule case this is the minimum value of the usual ground- 
state potential energy curve Eq{t). The operator H e l ec has no 
localized eigenfunctions; rather, its eigenfunctions are contin- 
uum functions. To avoid any misunderstanding, we emphasize 
that this result has nothing to do with the continuous spectrum 
of the full molecular Hamiltonian associated with the centre- 
of-mass motion which can be dealt with trivially in the 
preliminaries. 

A possibly helpful way to think about this paradoxical result 
is as follows. The quantum mechanical molecular Hamiltonian 
for a collection of electrons and nuclei with Coulomb interac- 
tions is a function of position and momentum operators for all 
the specified electrons and all the nuclei. If now we separate 
off the terms containing all the nuclear momentum operators 
(the terms proportional to K A ) what is left must be a function 
of position and momentum operators for the electrons and 
position operators for all the nuclei. This statement is true in 
any representation of the operators, and in particular must be 
respected if one chooses a position representation. 

This is not what Born and Oppenheimer assumed about 
their equation (12) [our equation(lO)] when K = - see 
§III-A above - and which has been assumed ever since in 
Quantum Chemistry. In effect they chose to work only in 
the 'small' Hilbert space of a fixed configuration, Jf(X), in 
which X can be assumed to be a 'parameter' in the position 
space wavefunction y/(x,X), whereas if they had continued 
with quantum mechanics they would have been working in 
the 'big' Hilbert space Jf with x and X treated on an equal 
footing as operators, and all possible nuclear configurations 
being treated simultaneously, rather than one at a time. 

The unusual properties of the ('electronic') Hamiltonian 
H„(x,p,X) = hP ec in equation (32) 15 considered as a 
quantum-mechanical operator on the whole space Sj, are of 
exactly the kind to be expected from the work of Kato [54]. 
In Lemma 4 of his paper he showed that for a Coulomb 
potential U and for any function / in the domain S^q of the 
full kinetic energy operator To, the domain, of the internal 
Hamiltonian H' contains and there are two constants a, b 
such that 

||f//||<a||f /||+fo||/|| 
where a can be taken as small as is liked. This result is 

14 After the elimination of the centre-of-mass variables H e l ec is playing 
the role of H in (20). 

15 We assume that the centre-of-mass contributions are eliminated as usual. 



often summarised by saying that the Coulomb potential is 
small compared to the kinetic energy. Given this result he 
proved in Lemma 5 (the Kato-Rellich theorem) that the usual 
Coulomb Hamiltonian operator is essentially self-adjoint and 
so is guaranteed a complete set of eigenfunctions, and is 
bounded from below. 

In the present context the important point to note is that 
the Coulomb term is small only in comparison with the 
kinetic energy term involving the same set of variables. So 
the absence of one or more kinetic energy terms from the 
Hamiltonian may mean that the Coulomb potential term cannot 
be treated as small. It is evident that one can't use the 
Kato-Rellich argument to guarantee self-adjointness for the 
customary representation of H e ^ ec in a position representa- 
tion as a differential and multiplicative operator because it 
contains the nuclear positions {X} in Coulomb terms that 
are not dominated by corresponding kinetic energy operators 
involving the conjugate momentum operators {— itiV} since 
they have been separated off into the 'perturbation' term « jc 4 . 
As a quite separate matter, the abstract direct integral operator 
(32) is self-adjoint since the resolvent of the clamped-nuclei 
Hamiltonian is integrable. This is demonstrated in Theorem 
XIII. 85 in the book by Reed and Simon [53]. It is in this 
form that the operator is used in the mathematically rigorous 
accounts (to be discussed later) of the Born-Oppenheimer 
approximation in [64] and [70]. The operator used in the 
standard account of Born and Huang [44] is however simply 
the usual one which, as discussed above, is not self-adjoint in 
the Kato sense. 



D. Approximate calculations 



It might have been hoped, in the light of the claim in the 
original paper by Born and Oppenheimer quoted in §III-A, 
that the eigensolutions of the K — > limit of the internal 
Hamiltonian, H', would actually be those that would have been 
obtained from (10) after separation of the centre-of-mass term, 
by letting the nuclear masses increase without limit. Although 
there are no analytically solved molecular problems, the work 
of Frolov [55] provides extremely accurate numerical solutions 
for a problem with two nuclei and a single electron. Frolov 
investigated what happens when the masses of one and then 
two of the nuclei increase without limit in his calculations. To 
appreciate his results, consider a system with two nuclei; the 
natural nuclear coordinate is the internuclear distance which 
will be denoted here simply as t. When needed to express the 
electron-nuclei attraction terms, x" is simply of the form 0C,t 
where a, is a signed ratio of the nuclear mass to the total 
nuclear mass; in the case of a homonuclear system (Xi~±j. 

The di-nuclear electronic Hamiltonian after the elimination 
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of the centre-of-mass contribution as described in §III-C is 



H elec (f,t) 



If- A 
2m ■ 



h 1 



-— y v 2 rt e ) 

'"■S 2(m 1 +m 2 ) i .j- 1 

,2 iV 

Ei + 



z 2 



4jMb£i\J15 + ait| It^ + M 



,5, Itf-t 



^ £ ~1 IN 



z t z 2 



unchanged. 

a? There are two quite distinct approaches to the solution of 

E V(ff ) • V(tj) the molecular Schrodinger equation (27) based on the formal 
theory reviewed in §III-C. Functions of the type (36) can 
be used as the basis of a Rayleigh-Ritz calculation being, 
hopefully, well-adapted to the construction of useful trial 
functions. Several different lines have been developed; in the 
adiabatic model the trial function is written as the continuous 
linear superposition 



R=\t 



(33) 



while the nuclear kinetic energy part is: 



f N<«) - -yU^r*" 3 -^"- (34) 

The full internal motion Hamiltonian for the three-particle 
system is then 



H'(t e ,t) = H elec (t e ,t) + T N (t). 



(35) 



which is of the same form as (26). 

It is seen from (34), that if only one nuclear mass increases 
without limit then the kinetic energy term in the nuclear vari- 
able remains in the full problem and so the Hamiltonian (35) 
remains essentially self-adjoint. Frolov's calculations showed 
that when one mass increased without limit (the atomic case), 
any discrete spectrum persisted but when two masses were 
allowed to increase without limit (the molecular case), the 
Hamiltonian ceased to be well-defined and this failure led to 
what he called adiabatic divergence in attempts to compute 
discrete eigenstates of (35). This divergence is discussed in 
some mathematical detail in the Appendix to Frolov [55]. It 
does not arise from the choice of a translationally invariant 
form for the electronic Hamiltonian; rather it is due to the lack 
of any kinetic energy term to dominate the Coulomb potential. 

To every solution of (29) there corresponds a function 



4>(t e ,t n ) ffl = <p(b,t% 5(t n - b) 



(36) 



in the (t e ,t n ) position representation which is a formal solu- 
tion, in the sense of distributions, of the Schrodinger equation 
for H elec . The energy, S" m (b) of the function (36) is indepen- 
dent of the orientation of the figure defined by the b, and 
is also unaltered by the parity operation b — > b, and 
by permutations of the labelling of any identical nuclei. <i> m 
however depends on the orientation of the body-fixed frame 
defined by the configuration b with respect to some space- 
fixed reference frame. Let the Euler angles relating these two 
frames be Q. so that 

4>(b),„ - *(b,Q) m 

in an obvious notation, so we have a continuous family of 
degenerate states. The dependence on orientation is eliminated 
by forming a continuous superposition through integration 
over the Euler angles with some weight function c(£2) 



dQ! c(Q') 4>(b,£2') 



Similarly one may form superpositions of the space-inverted 
and permuted states in order to form a new basis that displays 
the corresponding symmetries that leave the energy eigenvalue 



l P(t e ,t n ) m = JdbF(b) (p(b,t e ) ffl 5(t n - b) 

= F(t n ) <p(t n ,t e ) m (37) 

where the square-integrable weight factor F(t n ) may be deter- 
mined by reducing (27) to an effective Schrodinger equation 
for the nuclei in which F(t n ) appears as the eigenfunction 
[56]. 

If the {(p m } are chosen to be orthonormal we have 
(V m ) = J Jdf dt a |^(t e ,t n ),„| 2 = j dt n \F(t a )\ 2 

We may choose the weight factor F to be normalized, so that 
the state function is also normalized. On the other hand 

0F m |H'|*F m ) = J Jdf dt n Xn (A' *») 

= J dt a F(t n )* (A B F) (t n ) 

where we have defined the effective nuclear Hamiltonian 

(A m f) (t n ) - j df (p(t e ,t n ) m [A' <p(t e ,t n ) m F(t a ) 

The Rayleigh-Ritz quotient 

E[Wm] = <^|A'I^> 



(38) 



(39) 



(W (40) 

is stationary for those functions that are solutions of the 
effective nuclear 'Schrodinger equation' 

H m F x 



Ems F s 



(41) 



In particular, using the electronic ground state (po, the 
Rayleigh-Ritz quotient leads to an upper bound to the ground 
state energy Eq of H . Having set up the calculation with square 
integrable functions the approximate ground-state is naturally 
a discrete state; the discussion however yields no information 
about the bottom of the essential spectrum i.e. it does not 
prove the existence of a bound-state below the continuum. This 
calculation amounts to the diagonalization of the projection 
of H on the one-dimensional subspace spanned by ^o- in 
principle the subspace may be enlarged, and the accuracy 
thereby improved, by using the subspace spanned by a set 
of trial functions (^o^i , • • • ^m) of the form of (37). Such 
non-adiabatic calculations which make no use of a Potential 
Energy Surface are restricted to very small molecules. 

In practice the variational approach is implemented as fol- 
lows; a collection of energies E(bi) is found through standard 
quantum chemical computations for different geometries {b;} 
and fitted to produce a function V(t n ) that is treated as 
a potential energy contribution to the left-hand-side of the 
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Born equation (15), rather than (41), so the clamped-nuclei 
assumption enters in an essential way (see Appendix). With 
considerable computational effort it is possible to construct 
permutationally invariant energy surfaces for molecules with 
up to 10 nuclei [57]. Note however that if H' is separated as 
in (26), then it is H elec that appears in (39) rather than the 
clamped-nuclei Hamiltonian. 

Another generalization is to replace the unnormalizable 
delta function in (37) by a square integrable function; the 
relation 



, 3 

a \ i 



5 3 (x - y) = lim(-) 2 e -«(*-y) 2 = lim Zfl (x,y) 

a— >oo \ j a— 

suggests that one might consider trial wavefunctions 

V(fXf m CM = jdb F(b) (p(b,t e ),„ X a(t n ,b) 

for some suitably chosen parameter a. This is the basis of the 
molecular Generator Coordinate Method (GCM) which is a 
non-adiabatic formalism; as before the weight factor F (b) is 
determined by appeal to the Rayleigh-Ritz quotient, although 
part of its structure can be determined purely by symmetry 
arguments. In the GCM the effective Schrodinger equation 
for the weight function becomes an integral equation (the 
Hill-Wheeler equation) [45]. Again the trial function may be 
improved, in the sense of a variational calculation, by forming 
linear superpositions of the wavefunctions {vp GCM j.; this has 
been done for diatomic molecules for which a fairly complete 
GCM account has been developed [45], [58]. Usually however 
the dependence on the nuclear variables {t n } is not expressed 
through functions adapted to nuclear permutation symmetry, 
and the GCM weight functions are determined by molecular 
structure considerations. 

It should be noted here that <p(b,t e ) as a solution to the 
Schrodinger equation (29) where t n has been replaced by b, 
is defined only up to a phase factor of the form 

exp[;w(b)] 

w is any single-valued real function of the {b,} which can 
be different for different electronic states. The phase factor 
is only trivial in the absence of degeneracies. Specific phase 
choices may therefore be needed when tying this part to the 
nuclear part of the product wave function. It is only by making 
suitable phase choices that the electronic wave function is 
made a continuous function of the formal nuclear variables, b, 
and the complete product function, made single valued. This 
is the origin of the Berry phase in clamped-nuclei calcula- 
tions involving intersecting Potential Energy Surfaces; for a 
discussion of these matters see [59], [60]. It is worth noting 
explicitly that notions of molecular Berry phases and conical 
intersections of PE surfaces are tied to the clamped-nuclei 
viewpoint which introduces 'adiabatic parameters'. According 
to quantum mechanics the eigensolutions of (27) are single- 
valued functions by construction with arbitrary phases (rays) 
so one does not expect any Berry phase phenomena a priori. 

The rigorous mathematical analysis of the original pertur- 
bation approach proposed by Born and Oppenheimer [38] 
for a molecular Hamiltonian with Coulombic interactions was 



initiated by Combes and co-workers [61] - [64] with results for 
the diatomic molecule. Some properties of the operator H e ^ ec , 
(equation 32), seem to have been first discussed in this work. 
A perturbation expansion in powers of K leads to a singular 
perturbation problem because K is a coefficient of differential 
operators of the highest order in the problem; the resulting 
series expansion of the energy is an asymptotic series, closely 
related to the WKB approximation obtained by a semiclassical 
analysis of the effective Hamiltonian for the nuclear dynamics. 
This requires a more complete treatment than the adiabatic 
model using the partitioning technique to project the full 
Coulomb Hamiltonian, H', onto the adiabatic subspace. A 
normalized electronic eigenvector \(p(b)j) is associated with a 
projection operator by the usual correspondence 

P(b)j = \(p(b)j)((p(b)j\ 

In view of our earlier discussion of the 'big Hilbert space' Jf? , 
we can form a direct integral over all nuclear positions 



db 



to yield a projection operator on the adiabatic subspace. If we 
want to include m electronic levels we can form a direct sum 
of the contributing {Pj} 

m 

p = e ^ 

7=0 

This is an Hermitian projection operator and it, and its 
complement, Q, have the usual properties 

P + Q = 1, P 2 = P, Q 2 = Q, PQ = QP = 

Using these projection operators the original molecular 
Schrodinger equation 

H'|¥) = E\W) 

can be transformed into a pair of coupled equations 

PH'P\Y) + PH'QIX) = Ei\y/) (42) 

QH'P\W) + QH'Q\X) = E\\x) (43) 

where 

W) = \x) = 

Solving (43) for \%) 

1 



\X) 



El - QW'Q 

and substituting in (42) yields the usual Lowdin partitioned 
equation [65] 

1 



Ph'P + Ph'q 



El - QW'Q 



QU'P)\w) = E\W) (44) 



Further progress depends crucially on establishing the prop- 
erties of the energy dependent operator in (44). A detailed 
consideration of the diatomic molecule case can be found in 
[63], [64]. The main result is that (44) is a generalized version 
of the effective nuclear Schrodinger equation (41) in the adia- 
batic model, so it contains the nuclear kinetic energy operators 
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and an effective potential V. The microscope transformation 
used by Combes and Seiler [63] to give a rigorous version 
of the Born-Oppenheimer theory of a diatomic molecule is 
essentially a semiclassical theory. It is applicable if there is 
a minimum in the potential Vq = V(xq) associated with a 
particular configuration of the nuclei 16 that is deep enough for 
the lowest energy eigenstates to be localized about xo. One 
can look at these states with a 'microscope' with a certain 
resolving power that depends on Planck's constant. 

The microscope transformation produces a translation to 
make xo the origin of the coordinates, and a dilation (scale 
transformation) 



with finite masses m& and mg respectively. On the other hand 
the spectrum of the electronic Hamiltonian, H e ^ ec , is purely 
continuous, a(H e ^ ec ) = [Vo,°°)- In the limit m^mg — > °°, E 
does not generally converge to Vo; instead the missing part 
of the continuous spectrum [Vq, Xa(°°) + Xb(°°)] is provided 
by an accumulation of bound states in this interval [62]. 
The microscope transformation is formally applicable to the 
polyatomic case but it may not be sufficient to control the 
asymptotic behaviour, and has not been used for general 
molecules. 



(1 -A)(x - x ), pa 



(1-A) 



P (45) 



It is readily verified that the commutation relations are pre- 
served for A ^ 



[*a,Pa] 



[x,pj 



Under this transformation a Hamiltonian of the form 

+ V(x) 



H = 



becomes 



where 



N(X) 



H x = V(x ) + X 2 N(X) 



h 



v 2 

5 4 2m,, 



^ (V(x + A(x - x )) 



We now put A = sj% so as to eliminate A from the 
kinetic energy term in N(X); with this choice for A, unitary 
equivalence of the spectrum implies that the eigenvalues of 
the original Hamiltonian H are related to those of N(X) by 

E n = V(x ) + hHn(k) 

Provided V is analytic in A it can be expanded about A = 0, 
and this puts N(X), in lowest order, into the form of a sum 
of coupled oscillators so that the first approximation for the 
eigenvalue function \i n is 



L(»* + l) 



In the Born-Oppenheimer calculation for the diatomic 
molecule the potential V is identified with the effective poten- 
tial for the nuclei [63]; analyticity of V in A could be proven, 
and the role of sj% was taken by the usual BO expansion 
parameter K = (m e /Mff)^. In this way the molecular energy 
level formula (12) is recovered as an asymptotic expansion. 

The singular nature of the microscope transformation for 
A = is demonstrated by the modification of the spectrum 
associated with the limit K — > 0. The spectrum of the Coulomb 
Hamiltonian for a molecule can be discussed in terms of 
the Hunziker-van Winter-Zhislin theorem [66] -[68]; for the 
diatomic molecule, O ess (n) starts at the lowest two-body 
threshold E = Xa^a) + Ab(otb) given by the minimal value 
of the sums of pairs of binding energies for atoms A and B 

16 The multiminima case can also be treated in this way. 



Since the initial work of Combes, a considerable amount 
of mathematical work has been published using both time- 
independent and time-dependent techniques with develop- 
ments for the polyatomic case; for a recent review of rigorous 
results about the separation of electronic and nuclear motions 
see Hagedorn and Joye [69] which covers the literature to 
2006. The Hamiltonian (26) is the one used by Klein et al. [70] 
in their consideration of the precise formulation of the Born- 
Oppenheimer approximation for polyatomic systems. Their 
work was based on a powerful symbolic operator method, 
the pseudodifferential calculus [70], [71] and a formalism 
related to the partitioning technique described above. In [70] 
it is assumed that (26) has a discrete eigenvalue which has a 
minimum as a function of the t n in the neighborhood of some 
values t" = b ( . If it can be assumed that a) the electronic 
wavefunction vanishes strongly outside a region close to a 
particular nuclear geometry and b) that the electronic energy at 
V(x ))the given geometry is an isolated minimum, then it is possible 
to present a rigorous account of the separation of electronic 
and nuclear motion which corresponds in some measure to the 
original Born-Oppenheimer treatment. 

A novel feature arises from the requirement that the in- 
version symmetry of the original problem be respected. If 
the geometry at the minimum energy configuration is either 
linear or planar then inversion can be dealt with in terms of 
a single minimum in the electronic energy. If the geometry 
at the minimum is other than these two forms, inversion 
produces a second potential minimum and the problem must 
be dealt with as a two-minimum problem; then extra consid- 
eration is necessary to establish whether the two wells have 
negligible interaction so that only one of the wells need be 
considered for the nuclear motion. The nuclei are treated as 
distinguishable particles that can be numbered uniquely. The 
symmetry requirements on the total wavefunction that would 
arise from the invariance of the Hamiltonian operator under the 
permutation of identical nuclei are not considered. Because of 
the continuous spectrum of the electronic Hamiltonian H e ^ ec , 
it is not possible to use regular perturbation theory in the anal- 
ysis; instead asymptotic expansion theory is used so that the 
result has essentially the character of a WKB approximation 
[70]. Similar functional analytic techniques have been used to 
consider such phenomena as Landau-Zener crossing by using 
a time-dependent approach to the problem and looking at the 
relations between the electronic and nuclear parts of a wave- 
packet [72]. This is essentially a use of standard coherent state 
theory where again the nuclei are treated as distinguishable 
particles and the method is that of asymptotic expansion. 
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IV. Discussion 

Quite generally one needs to make a distinction between 
an hypothetical Isolated Molecule, and a real observable 
individual molecule. There are no strictly 'isolated' systems 
of course, but what is striking is that an approach based on 
the stationary state eigensolutions of the appropriate Coulomb 
Hamiltonian works so well for atoms and diatomic molecules, 
but fails with three or more nuclei. We have always been clear 
that for most of chemistry, molecular eigenstates ('stationary 
states') are of no relevance since metastability is an essential 
aspect of isomerism. The interesting question is how to get 
from the quantum theory of an Isolated Molecule to a quantum 
theory of an individual molecule by rational mathematics. It is 
as well to remember that the generic molecule is sufficiently 
complex that the quantum mechanical permutation symmetry 
of identical nuclei is a feature that cannot be ignored, if 
one is doing quantum mechanics. The Isolated Molecule 
model doesn't capture isomerism, nor optical activity. We 
see no reason at all for Lowdin's optimistic assertion (§1) 
that molecular symmetry must be contained somehow in the 
Coulomb Hamiltonian. 

If a molecule is not isolated it must be interacting with 
something; that something is loosely referred to as the 'en- 
vironment'. It might be other molecules, the (macroscopic) 
substance the molecule finds itself in, or quantized electro- 
magnetic radiation. Blackbody radiation is all pervasive and 
charges are always coupled to the photon vacuum state in 
QED and so 'dressed' with clouds of virtual photons. A crucial 
feature of 'environments' in quantum theory is that generally 
they are described by Hamiltonians with purely continuous 
spectra. This is important because a quantum system with a 
finite number, n, of degrees of freedom described by the usual 
linear Schrodinger equation does not yield 'broken symmetry' 
solutions if n < °°. Such matters were discussed at length 
thirty years ago in the context of molecular structure and 
quantum theory [73] - [75]. The characteristic feature of such 
discussions, and this also applies to more modern formulations 
under the chic heading of 'decoherence', is that they start with 
some primitive notion of structure built in: two-state systems, 
potential energy wells, wavefunctions associated with distinct 
isomers etc.. The 'environment' is modelled in the simplest 
possible way (for example, a free boson quantum field). These 
crucial ideas are put in by hand at the outset. We don't see 
that as a 'problem' or 'difficulty'; it is a characteristic feature 
of many-body physics (condensed matter, nuclei, chemistry) 
and results in remarkably powerful and fruitful theoretical 
formalisms; see, for example, Anderson's discussion of what 
he calls 'adiabatic continuity' [76]. But one can hardly avoid 
noticing that the models of molecules used are caricatures that 
contain just the right features for the answer (for example, 
loss of permutation symmetry, loss of parity - in the case of 
chirality) and have no clear connection to Lowdin's's Coulomb 
Hamiltonian. 

An alternative account that is based on the Coulomb 
Hamiltonian may however be possible in the light of the 
fact that each part of it in the division made in (26) has a 
completely continuous spectrum. As noted in §III-D the formal 



eigenvectors of H e l ec from the ground state up can exhibit 
extensive degeneracy. It might be that 'broken symmetry' so- 
lutions corresponding to molecular structure could result from 
treating the two parts as asymptotic states in a scattering or 
reaction process in a manner analogous to that used in standard 
S -matrix theory. Such a state would be characterised as a 
'resonance' and would have to be long-lived to be describable 
as a molecule. Only a true bound state, of infinite lifetime, 
such as a bound eigenstate of the molecular Hamiltonian is 
really independent of how it was formed; resonant states have 
histories that describe the environment of their preparation. 
The Potential Energy Surface would then appear only as an 
auxiliary concept through the involvement of the clamped- 
nuclei Hamiltonian in the construction of the states {<p,„} 
required for equation (36). That however remains subject to 
further investigation. 

When Bohr introduced his quantum theory of the Ruther- 
ford atomic model of the hydrogen atom he made a drastic 
change in the status of electrodynamics. Hitherto, it had 
been understood 17 that charged particles affected, and were 
affected by, the electromagnetic field, and that was the root 
cause of the failure of a dynamical classical atom ('radiation 
damping' is a strong coupling interaction). Bohr relegated 
the electromagnetic field to a perturbation theoretic - weak 
coupling - role; the charges interacted among themselves 
according to Coulomb's law, to be treated as a strong coupling 
situation, and would exist permanently in the stationary states 
selected by the quantization conditions unless perturbed by an 
'external' electromagnetic field which produced 'transitions'. 
That perturbation theory viewpoint was maintained when 
quantum theory was applied to the atom and the electromag- 
netic field, and largely survives to this day, to the extent that 
the electromagnetic field is frequently regarded as a classical 
system. Such a spectroscopic viewpoint is not appropriate 
in the present context; quantum electrodynamics teaches us 
that there is no strict separation of charged particles and the 
(quantized) electromagnetic field, not least because of the 
requirements of gauge invariance. 

The difficulty with quantum mechanical perturbation theory 
for the interaction of atomic/molecular systems with radiation 
is this: the spectrum of the unperturbed atom/molecule consists 
of a continuum corresponding to the half-axis [E, °°) for some 
E < 0, and discrete energy levels Eq,E\,... below the 
continuum, that is Eq < E\ <, . . . , < E [66] - [68]. The spec- 
trum of the free electromagnetic field Hamiltonian consists 
of a simple eigenvalue at 0, corresponding to the vacuum 
state an d absolutely continuous spectrum on the half- 
axis [0,°°). This means that when coupling between particles 
and radiation is admitted, all the discrete energy levels of the 
atomic system including Eq become thresholds of continuous 
spectra; a quantum theory of matter and the electromagnetic 
field therefore requires the perturbation theory of continuous 
spectra. The quantum mechanical theory of electrons and 
nuclei interacting with quantized radiation in the low-energy 
regime is an active area of research in mathematical physics 
concerned with the stability of matter, the existence of the 

17 The Lorentz Theory of the electron for example [77]. 
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thermodynamic limit etc., but with no particular reference to 
features of chemical interest [78] - [82]. 

The presentation of a presumed exact bound state solution 
of the Schrodinger Coulomb Hamiltonian as a product of 
electronic and nuclear factors has been considered both by 
Hunter [83] and, more recently, by Gross et al. [84]. For 
present purposes the Hunter approach will be employed on 
the translationally invariant form of the internal Hamiltonian, 
given earlier (in §III-C). Were the exact solution known, 
Hunter argues that it could be written in the form 

V/(t n ,t e ) = Z (t")0(t n ,t e ) (46) 

with the nuclear wave function defined by means of 

| Z (t n )| 2 - J y/(t n ,tTv(t n ,t e )</t e (47) 

Providing the function j(t n ) has no nodes, 18 an 'exact' 
electronic wavefunction could be constructed as 

Vf(t n t e ) 

if the normalization choice 

J ^(t",t e )>(t n ,t e )#=l 

is made. The electronic wavefunction (48) is then properly 
defined, and a 'Potential Energy Surface' could be defined in 
terms of it by integrating out the electronic variables in the 
expectation value of the internal Hamiltonian in the state 0, 

U(t n ) = J <Kt n ,tTH'(t n ,t e )<Kt n ,t e )rft e (49) 

The nuclear function % is evidently quite different [86] from 
the usual approximate nuclear wavefunctions for vibrationally 
excited states which do have nodes. 

Although no closed solutions to the full problem are known 
for a molecule, some extremely good approximate solutions 
have been obtained for excited vibrational states of H2; Czub 
and Wolniewicz [87] took such an accurate approximation for 
an excited vibrational state in the J — rotational state of H2 
and computed U(/?). They found strong spikes in the potential 
close to two positions at which the usual vibrational wave 
function would have nodes. To quote [87] 

This destroys completely the concept of a single 
internuclear potential in diatomic molecules because 
it is not possible to introduce on the basis of non- 
adiabatic potentials a single, approximate, mean 
potential that would describe well more than one 
vibrational level. 

It is obvious that in the case of rotations the situation 

is even more complex. 
Wilson suggested [88] that using the clamped-nuclei Hamil- 
tonian instead of the full one in (49) to define the potential 
might avoid the spikes but Hunter in [86] showed why this 
was unlikely to be the case, and Cassam-Chenai [89] repeated 
the work of Czub and Wolniewicz using a clamped-nuclei 

18 A similar requirement must be placed on the denominator in equation 
(12) of [85] for the equation to provide a secure definition 



electronic Hamiltonian and showed that exactly the same spiky 
behaviour occurred. 

Another approach to this problem is in [84]; there is reason 
to believe however that this sort of difficulty is bound to 
arise whatever the approach. To see this, simply rewrite (46) 
to recognise that the exact states will actually have definite 
quantum numbers according to their symmetry, so that it would 
be more realistic to write 

VjMpri(t n ,t e ) = XJMPri(t a )hM P ri(t n ,t e ) (50) 

In the H2 study cited the first four quantum numbers are of 
no relevance, only ;' remains and here i labels the vibrational 
states. There is thus every reason to expect that the best that 
can be done from this approach is a distinct PES for each 
nuclear-motion state. 

This anticipated behaviour seems to be confirmed in very 
accurate calculations on H2 [90] for the electronic £ ground 
state of the molecule assumed to dissociate into two hydrogen 
atoms in their ground states. That work shows that, for 
example, the J — state supports just 14 vibrational states 
while the J — 15 state supports 10 and the J = 31 supports only 
1 state. Of course in a diatomic molecule, states of different 
k are states which differ in the electronic angular momentum 
and these results cannot be regarded as typifying the results 
for a polyatomic system. However work on H^ shows that 
in the case of J = there are 1280 vibrational states below 
dissociation [91] and that 46 is the highest value of J for which 
at least one vibrational state exists [92]. At this level then it 
cannot be assumed that the potential surface calculated in the 
usual way is an approximation to anything exact. 

The eigenstates of the full molecular Hamiltonian (the 
Coulomb Hamiltonian for the electrons and nuclei specified 
by a chemical formula) - a theory of an Isolated Molecule 
modeled on the quantum theory of the atom which we call 
the Isolated Molecule model - are reasonably well understood 
and might have some utility in a limited area of high-resolution 
experiments on very small molecules where questions of iso- 
merism do not arise [93]. Their computation poses formidable 
problems, and really belongs to few-body physics. If it is to 
be taken as underlying Quantum Chemistry then it is worth 
exploring the consequences of the model without regard to 
approximations made for practical utility which are a quite 
separate matter. In this paper we have attempted to discuss 
the Born-Oppenheimer and Born approaches to the quantum 
theory of molecules in terms first set out by Combes [61]. 
The essential point is that the decomposition of the molecular 
Hamiltonian (with centre-of-mass contribution removed) into 
the nuclear kinetic energy, proportional to K 4 and a remainder, 
is specified by equation (26), not by (9), or in other words, 
equation (9) cannot be written with an — sign if the conven- 
tional interpretation of the X acting as parameters is made. 
Allowing the nuclear masses to increase without limit in H e l ec 
does not produce an operator with a discrete spectrum since 
this would just cause the mass polarisation term to vanish and 
the effective electronic mass to become the rest mass. As we 
have seen it leads to 'adiabatic divergence' [55]. 

It is thus not possible to reduce the molecular Schrodinger 
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equation to a system of coupled differential equations of 
classical type for nuclei moving on Potential Energy Surfaces 
as suggested by Born. An extra choice of fixed nuclear 
positions must be made to give any discrete spectrum and 
normalizable L 2 eigenfunctions. In our view this choice, that 
is, the introduction of the clamped-nuclei Hamiltonian into 
the molecular theory as in §III-A, is the essence of what is 
commonly meant by the expression 19 , the 'Born-Oppenheimer 
approximation' 



H 



elec 



/ K(b,t e ) 
Jx 



db -> K(b,t e ) 



(51) 



If the molecular Hamiltonian H were classical as in [36], the 
removal of the nuclear kinetic energy terms would indeed leave 
a Hamiltonian representing the electronic motion for stationary 
nuclei, as claimed by Born and Oppenheimer [38], [46]. As we 
have seen, quantization of H changes the situation drastically, 
so an implicit appeal to the classical limit for the nuclei 
is required. The argument is a subtle one, for subsequently, 
once the classical energy surface has emerged, the nuclei 
are treated as quantum particles for the determination of 
the vibration-rotation spectrum (though indistinguishability is 
rarely carried through); this can be seen from the complexity 
of the mathematical account given by Klein and co-workers 
[70]. 

This qualitative modification of the internal Hamiltonian, 
the extra choice of fixed nuclear positions in the 'electronic' 
Hamiltonian, is ad hoc in the same sense that Bohr's quantum 
theory of the atom is an ad hoc modification of classical 
mechanics. An essential feature of the answer is put in by 
hand. We know that both modifications have been tremen- 
dously useful and our point is not that something else must be 
done in practical calculations on molecules. The point is how 
the successful description of molecules involving the clamped- 
nuclei modification at some stage can best be understood in 
terms of quantum mechanics. In the case of the Bohr atom 
the resolution of the inconsistency in mechanics applied to 
the microscopic realm was achieved quite quickly with the 
formulation of quantum mechanics; in the molecular case, no 
such resolution is at present known. 

Appendix 

In this appendix we give an heuristic account of the math- 
ematical notion of the direct integral of Hilbert spaces, and 
then study a simple model problem to illustrate the general 
ideas discussed in the paper. 

Consider a self-adjoint operator T that depends on a param- 
eter X, so T = T(X). The parameter X = —°°<X < +°°, 
covers the whole real line and T(X) is assumed to be 
defined for all X. T(X) is an operator on a Hilbert space, 
which is denoted Jt?(X); its eigenfunctions {0} defined by 

T(X) <j)(X) k = X k {X) <j)(X) k . 

form a complete orthogonal set for the space Jf?(X). The 



"in its original form b = b„, the equilibrium configuration, on the right- 
hand side of (51). 



scalar product is 

(<j>(X) k \<j>(X)j) x = f <j>(X,x)* k <i>(X,x)jdx = f(X) kj = f(X) 8 kj 

(52) 

T may have discrete eigenvalues below a continuous spectrum 
that starts at A, that is 

a{X) = a(T(X)) = [h(X)MX),-M*)) U ^),»)- 

Now let's introduce a 'big' Hilbert space if as a direct 
integral over the {Jf(X)}, 

Jf = f° Jff(X) dX (53) 
and correspondingly the operator acting on iff defined by 

r® 

ST = / T(X) dX. 

J0g 

The scalar product on the big space is defined explicitly 
in terms of (52) by 

tf(X)M(X)j)je := ( (<l>(X) k \(l>(X)j) x dX < - (54) 

In equation (52) one can always chose the functions {<p k } to 
be orthonormalized independently of X, 

/(*) = 1 

However this choice is not consistent with (54), which requires 
f(X) to decrease sufficiently fast as \X\ — > °° for the integral 
to exist. The mathematical motivation for this construction is 
this: the cartesian product of the spaces {Jf(X)}, 

& = n ^(*) 

XeM 

is a field of Hilbert spaces over 8% which has a natural vector 
space structure. In modern geometric language, the Hilbert 
space Jf(X) is a fibre over a point X in a fibre bundle; & 
is the vector space of sections of this bundle. The subspace 
of & consisting of square integrable sections is the direct 
integral (53). The direct integral is the generalization to the 
continuous infinite dimensional case of the notion of the direct 
sum of finite dimensional vector spaces. 

Purely as a heuristic explanation suppose initially that the 
parameter X has only two discrete values {Xi,X2}. There are 
then two eigenvalue equations to consider, and two associated 
Hilbert spaces. In the direct sum space we have 

^2 = T(Xi) e t(x 2 ), jt = e Jt?{x 2 ) 

The eigenfunctions of 2Ti are then two-dimensional column 
vectors, and so 



^2 



(A,(X0 + Xj(X 2 )) 



<P(Xi) k 



The spectrum of 3*2 is the union of the spectra of T(X{) and 
T{X2). This discussion is trivially extended to n points {X k : 
k = l,...,n}, with the spectrum of ,% given by the union 
of the n operators T(X\ ),..., T(X n ). The limit n — > °° is not 
trivial since it brings in important notions from topology and 
integration (measure theory) which we gloss over [94]. When 
these are taken into consideration however the result is that the 
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spectrum a of is purely continuous since its direct integral 
representation implies that its spectrum is the union of the 
spectra of the infinite set of T(X) operators, 

a = a{J) = [J a(X) = [L ,~) (55) 
x 

where Lq is the minimum value of Ao(X). The eigenvalue 
equation for 2F is, 

2? 4>a = A 4>a, Lq < A < °o 

with A a continuous index for the {<$>}. Even if r(X) is self- 
adjoint, it doesn't follow that its direct integral 3~ is self- 
adjoint; that depends on specifics and has to be investigated. 
So the {4>} cannot be assumed to be complete. 

In the application of this mathematics to the Born- 
Oppenheimer approximation, the role of x is taken by the 
electronic coordinates t e , and X is to be identified with definite 
choices of the nuclear coordinates b. If there are M nuclei the 
parameters b are elements of ^ 3 ( M_1 ). The operator T(X) 
is the clamped-nuclei Hamiltonian K(b,t e )„ = K . With 
the conventional normalization of clamped-nuclei electronic 
eigenfunctions independent of the nuclear positions b, the 
formal eigenvectors, (36), [56] of H e l ec do not belong to the 
Hilbert space Jf; this simply reflects the loose use of the 
Dirac delta function for the position operator eigenfunctions. 

We now consider a concrete model consisting of coupled 
harmonic oscillators with two degrees of freedom; we try 
to mimic the steps taken in the usual Born-Oppenheimer 
discussion. Consider the following Hamiltonian where K and 
a are dimensionless constants 20 

H = p 2 + k 4 P 2 + x 2 + X 2 + axX (56) 

The only non-zero commutators of the operators are 

[x,p] = i [X,P] = i 

Following the conventional discussion of electron-nuclear sep- 
aration outlined in §III-A,§III-B, define 

H„ = f + x 2 + axX + X 2 (57) 
Hi = P 2 (58) 

so that 

H = H + k 4 Hi (59) 
with Schrodinger equation 

(A - E^V = (60) 

We note that 

[A ,Ai] ^ o (61) 

so the two parts cannot be simultaneously diagonalized. A 
principal axis transformation of the whole expression H brings 
it to separable form, but we do not need to pursue explicitly 
the full solution here. 
On the other hand 

[H ,X] = (62) 

20 The variables are expressed in dimensionless form for simplicity. The 
quantum oscillator h = j(p 2 + q 2 ) has eigenvalues n + \. 



so these two operators may be simultaneously diagonalized, 
and consider H„ at a definite eigenvalue of X, say X 

K = p 2 + x 2 + axX + X 2 (63) 

This is the Hamiltonian (in the variables x, p) of a dis- 
placed oscillator in which X is a (c-number) parameter, with 
Schrodinger equation in position representation 

K„ (p(x,X)„ = E n (X) (p{x,X) n (64) 

K is the analogue in this model of the ' clamped-nuclei' 
electronic Hamiltonian. 

The solution is immediate; we make a unitary transforma- 
tion by introducing a displaced coordinate involving X 

9! = x - \aX, p' = p (65) 
= e lflX P/ 2 , R' = Cr'KoO (66) 

so that the transformed K„ in the new variables is 

k' = p' 2 + x' 2 + (i - ^)x 2 

which has oscillator eigenvalues and eigenfunctions 

E n (X) = E° n + (1 - ^-)X 2 , (p(x% (67) 

where x' = x — the {<p n } are the usual harmonic oscillator 
eigenfunctions and E® is the energy of the free oscillator 
2(n + j). For fixed n the spectrum o(X) is discrete and, 
as a function of the X parameters, would be conventionally 
interpreted as a 'potential energy curve'. As far as (59) is 
concerned, K„, evaluated at some point Xq cannot be regarded 
as an 'approximation' to H„, since obviously 

[K o (X ),Hi] = 

so they can be simultaneously diagonalized, and H ] has purely 
continuous spectrum (free motion). So we have to consider X 
in the full problem in its operator form, X. 

We make the same unitary transformation of the x, p vari- 
ables in H„ as before, and it is still brought to diagonal form; 
however Hi will be modified because P is also translated by 
the operator in (66) that generates the coordinate displace- 
ment (cf equation (61)); thus 

Cr 1 p = p + \ap 

so the transform of H ] contains linear and quadratic terms in 
P- 

Nevertheless (62) is still valid, and formally we may write 
H as a direct integral 

H f , = / e K(x,p,X) dX (68) 

■IX 

The Schrodinger equation for H„ in position representation is 
now an equation involving functions of x and X 

H„ <b e {J,X) = e ® e (x>,X) (69) 

Just as before (see (55)) the direct integral decomposition (68) 
implies that the spectrum is purely continuous, explicitly 

a(H G ) = \Ja(X) = [l.oo). (70) 
x 
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e in (69) takes all positive values > 1, where 1 is the 
minimum value of the oscillator eigenvalue 2(n + j). The 
associated continuum eigenfunctions {4>} may formally be 
written as products of oscillator eigenfunctions (in x'), and 
delta functions (in X). They don't lie in Hilbert space of course 
and one needs the Gel'fand construction of a rigged space to 
make sense of the formal calculation [95]. If one returns to the 
x variable, the {<p„} are functions of x and X, since x' — x 1 (X). 

One can't do anything very useful with the direct integral 
expression (68) for H„ apart from adding it onto the K 4 term, 
which just returns us to the full problem. The full wavefunction 
in (60) can be expanded as 

V(x>,X) = t <P (*'(*')) S(X - X') c{X') n dXll) 
= £c(X)„ <p{x'(X)) n (72) 

n 

which obviously leads towards a variational approach [56]; 
such expansions rely on the completeness of the states em- 
ployed. In this simple problem there is no difficulty, but as 
noted earlier, in realistic Coulomb systems it is much less 
clear that a complete set of states is available. 

However that may be, let us rehearse again the argument due 
to Born summarized in §III-B. We substitute (72) in equation 
(60), left multiply by and integrate out the x' variables to 
leave an equation for the coefficients {c(X) n }, 



j dx> (p((x'(X))l 



H„ + 



K 4 H, 



£c(x)„ <p(( x '(x)) n 



= E fdx> 9({x f (X))* m £c(X)„ <p((x'(X)) n (73) 
j n 

At this point in the conventional account, H„ is replaced by 
K , equation (63), and then the action of K on the functions 
{<p} in equation (73) can be evaluated using (64) in the well- 
known way, 

H <p{x'{X)) n -> K <p(x'{Xj) n = e(X) n (p{x'(X)) n 

^From the foregoing discussion it is clear that the substitution 
of H by K makes a qualitative change in the theory. This 
change does seem to be the 'right' thing to do, but so far there 
is no explanation as to why this is so. 
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